351 research outputs found

    Neural Novel Actor: Learning a Generalized Animatable Neural Representation for Human Actors

    Full text link
    We propose a new method for learning a generalized animatable neural human representation from a sparse set of multi-view imagery of multiple persons. The learned representation can be used to synthesize novel view images of an arbitrary person from a sparse set of cameras, and further animate them with the user's pose control. While existing methods can either generalize to new persons or synthesize animations with user control, none of them can achieve both at the same time. We attribute this accomplishment to the employment of a 3D proxy for a shared multi-person human model, and further the warping of the spaces of different poses to a shared canonical pose space, in which we learn a neural field and predict the person- and pose-dependent deformations, as well as appearance with the features extracted from input images. To cope with the complexity of the large variations in body shapes, poses, and clothing deformations, we design our neural human model with disentangled geometry and appearance. Furthermore, we utilize the image features both at the spatial point and on the surface points of the 3D proxy for predicting person- and pose-dependent properties. Experiments show that our method significantly outperforms the state-of-the-arts on both tasks. The video and code are available at https://talegqz.github.io/neural_novel_actor

    Delving into Discrete Normalizing Flows on SO(3) Manifold for Probabilistic Rotation Modeling

    Full text link
    Normalizing flows (NFs) provide a powerful tool to construct an expressive distribution by a sequence of trackable transformations of a base distribution and form a probabilistic model of underlying data. Rotation, as an important quantity in computer vision, graphics, and robotics, can exhibit many ambiguities when occlusion and symmetry occur and thus demands such probabilistic models. Though much progress has been made for NFs in Euclidean space, there are no effective normalizing flows without discontinuity or many-to-one mapping tailored for SO(3) manifold. Given the unique non-Euclidean properties of the rotation manifold, adapting the existing NFs to SO(3) manifold is non-trivial. In this paper, we propose a novel normalizing flow on SO(3) by combining a Mobius transformation-based coupling layer and a quaternion affine transformation. With our proposed rotation normalizing flows, one can not only effectively express arbitrary distributions on SO(3), but also conditionally build the target distribution given input observations. Extensive experiments show that our rotation normalizing flows significantly outperform the baselines on both unconditional and conditional tasks.Comment: CVPR 202

    Single-molecule real-time sequencing of the full-length transcriptome of Portunus pelagicus

    Get PDF
    Reconstruction and annotation of transcripts, particularly for a species without reference genome, plays a critical role in gene discovery, investigation of genomic signatures, and genome annotation in the pre-genomic era. This is the first study to use Single-molecule real-time (SMRT) sequencing for reporting the full-length transcriptome of Portunus pelagicus. Overall, 16.26 Gb of raw reads were obtained, including 7,068,387 subreads, with average length of 2,300 bp and N50 length of 3,594 bp. In total, 351,870 circular consensus sequences (CCS) reads were extracted, including 255,378 full-length non-chimeric (FLNC) reads with mean length of 3,423 bp.70,407 genes were obtained after eliminating redundant sequences, and 56,557 (80.33%) genes were annotated in at least one database, 17,267 (24.52%) genes were annotated in all of the seven databases. Further, 68,797 coding sequences (CDS) were identified, including 36,848 complete CDS. A total of 1,730 unigenes were predicted to be transcription factors (TFs). Finally, 11,894 long noncoding RNA (lncRNA) transcripts were predicted by different computational approaches and 147,262 single sequence repeat (SSR)s were obtained. The transcriptome data reported herein are bound to serve as a basis for future studies on P. pelagicus

    Learning Controllable 3D Diffusion Models from Single-view Images

    Full text link
    Diffusion models have recently become the de-facto approach for generative modeling in the 2D domain. However, extending diffusion models to 3D is challenging due to the difficulties in acquiring 3D ground truth data for training. On the other hand, 3D GANs that integrate implicit 3D representations into GANs have shown remarkable 3D-aware generation when trained only on single-view image datasets. However, 3D GANs do not provide straightforward ways to precisely control image synthesis. To address these challenges, We present Control3Diff, a 3D diffusion model that combines the strengths of diffusion models and 3D GANs for versatile, controllable 3D-aware image synthesis for single-view datasets. Control3Diff explicitly models the underlying latent distribution (optionally conditioned on external inputs), thus enabling direct control during the diffusion process. Moreover, our approach is general and applicable to any type of controlling input, allowing us to train it with the same diffusion objective without any auxiliary supervision. We validate the efficacy of Control3Diff on standard image generation benchmarks, including FFHQ, AFHQ, and ShapeNet, using various conditioning inputs such as images, sketches, and text prompts. Please see the project website (\url{https://jiataogu.me/control3diff}) for video comparisons.Comment: work in progres

    GPU-ASIFT:A Fast Fully Affine-Invariant Feature Extraction Algorithm

    Get PDF

    A novel SETD2 variant causing global development delay without overgrowth in a Chinese 3-year-old boy

    Get PDF
    Background: Luscan-Lumish syndrome is characterized by macrocephaly, postnatal overgrowth, intellectual disability (ID), developmental delay (DD), which is caused by heterozygous SETD2 (SET domain containing 2) mutations. The incidence of Luscan-Lumish syndrome is unclear. The study was conducted to provide a novel pathogenic SETD2 variant causing atypical Luscan-Lumish syndrome and review all the published SETD2 mutations and corresponding symptoms, comprehensively understanding the phenotypes and genotypes of SETD2 mutations.Methods: Peripheral blood samples of the proband and his parents were collected for next-generation sequencing including whole-exome sequencing (WES), copy number variation (CNV) detection and mitochondrial DNA sequencing. Identified variant was verified by Sanger sequencing. Conservative analysis and structural analysis were performed to investigate the effect of mutation. Public databases such as PubMed, Clinvar and Human Gene Mutation Database (HGMD) were used to collect all cases with SETD2 mutations.Results: A novel pathogenic SETD2 variant (c.5835_c.5836insAGAA, p. A1946Rfs*2) was identified in a Chinese 3-year-old boy, who had speech and motor delay without overgrowth. Conservative analysis and structural analysis showed that the novel pathogenic variant would loss the conserved domains in the C-terminal region and result in loss of function of SETD2 protein. Frameshift mutations and non-sense mutations account for 68.5% of the total 51 SETD2 point mutations, suggesting that Luscan-Lumish syndrome is likely due to loss of function of SETD2. But we failed to find an association between genotype and phenotype of SETD2 mutations.Conclusion: Our findings expand the genotype-phenotype knowledge of SETD2-associated neurological disorder and provide new evidence for further genetic counselling

    GPU-ASIFT:A Fast Fully Affine-Invariant Feature Extraction Algorithm

    Get PDF

    Improving Utility of GPU in Accelerating Industrial Applications with User-centred Automatic Code Translation

    Get PDF
    SMEs (Small and medium-sized enterprises), particularly those whose business is focused on developing innovative produces, are limited by a major bottleneck on the speed of computation in many applications. The recent developments in GPUs have been the marked increase in their versatility in many computational areas. But due to the lack of specialist GPU (Graphics processing units) programming skills, the explosion of GPU power has not been fully utilized in general SME applications by inexperienced users. Also, existing automatic CPU-to-GPU code translators are mainly designed for research purposes with poor user interface design and hard-to-use. Little attentions have been paid to the applicability, usability and learnability of these tools for normal users. In this paper, we present an online automated CPU-to-GPU source translation system, (GPSME) for inexperienced users to utilize GPU capability in accelerating general SME applications. This system designs and implements a directive programming model with new kernel generation scheme and memory management hierarchy to optimize its performance. A web-service based interface is designed for inexperienced users to easily and flexibly invoke the automatic resource translator. Our experiments with non-expert GPU users in 4 SMEs reflect that GPSME system can efficiently accelerate real-world applications with at least 4x and have a better applicability, usability and learnability than existing automatic CPU-to-GPU source translators
    corecore